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Abstract 

^ Estimation of the Embedding capacity is an important problem specifically in reversible 

O multi-pass watermarking and is required for analysis before any image can be watermarked. In 

^-H this paper, we propose an efficient method for estimating the embedding capacity of a given 

cover image under multi-pass embedding, without actually embedding the watermark. We 
demonstrate this for a class of reversible watermarking schemes which operate on a disjoint 
group of pixels, specifically for pixel pairs. The proposed algorithm iteratively updates the 
co-occurrence matrix at every stage, to estimate the multi-pass embedding capacity, and is 
much more efficient vis-a-vis actual watermarking. We also suggest an extremely efficient, pre- 
computable tree based implementation which is conceptually similar to the co-occurrence based 
^ method, but provides the estimates in a single iteration, requiring a complexity akin to that 

' of single pass capacity estimation. We also provide bounds on the embedding capacity. We 

finally show how our method can be easily used on a number of watermarking algorithms and 
' ' specifically evaluate the performance of our algorithms on the benchmark watermarking schemes 

^ of Tian [11] and Coltuc et al [6]. 

in 

lO 1 Introduction 

^-H Reversible Watermarking [4] is a technique used to preserve the copyright of digital data (image, 

^ audio and video) , while at the same time it ensures exact recoverability of the watermark as well as 

the cover image. This is mainly significant in apphcations concerning military and medical image 
processing, legal and multimedia archiving of valuable original works, etc. We briefly describe 
below some prominent schemes in reversible watermarking. 
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1.1 Past work 



There are many algorithms proposed for reversible watermarking, described comprehensively in 
the survey papers [18, 8]. There are four major techniques of embedding watermarks in reversible 
watermarking schemes, namely: histogram bin shifting, lossless data compression, expansion and 
mapping based techniques and prediction based techniques. Histogram bin shifting based tech- 
niques [21] suffer from the basic limitation of low embedding capacity, while data compression based 
techniques [16] mostly involve computationally expensive algorithms. There have been several al- 
gorithms proposed and implemented based on transforms on groups of pixels [1, 5, 6, 7, 11, 14, 22], 
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because of the basic advantage of high embedding capacity and a modest computation cost. Ma- 
jority of these techniques operate on a pair of pixels. The first of these was proposed by Tian [11] 
and subsequently extended by [1, 7, 14]. These techniques are location map based and hence re- 
quire an additional step of data compression. Recently reversible contrast mapping (RCM) based 
methods [2, 5, 6] have been used to efficiently embed data without using a location map. Prediction 
based techniques [7, 19] have also been suggested and they use information from the neighboring 
pixels to embed information. 

1.2 Multi-peiss capacity estimation 

Loosely the embedding capacity of an image can be described as the size of the largest watermark 
which can be embedded into that image. Each watermarking technique has a maximum possible 
embedding capacity over a single pass, and hence often it is necessary to go for multiple passes to 
embed a much larger watermark into the given image. Recall that multipass embedding involves at 
every stage, successively embedding the watermark bits into the already watermarked image from 
the previous stage. Consequently any watermarking application would require an estimation of the 
number of passes of watermarking possible as well as an analysis of the feasibility of inserting a 
watermark of specific length into a given image. For this purpose it is necessary to calculate the 
embedding capacity beforehand. In practical settings, it may be necessary to find such estimates 
repeatedly for different configurations of the watermark and the cover image. Hence it may not 
be feasible to actually embed the watermark in a given image, and to check if the watermark 
and the cover image are compatible for watermarking. Since most watermarking algorithms are 
quite slow, the possibility of having to check for multiple iterations of embedding before choosing 
the right watermark would make the task computationally quite demanding. In particular, these 
watermarking schemes tend to be quite complex and involve data-compression stages which makes 
the task of embedding computationally expensive. Hence we require efficient estimation algorithms 
for the computation of embedding capacity of different watermarking schemes. The problem of 
multi-pass embedding capacity estimation has not been studied much in the literature despite this 
being needed before any cover image or watermark could be selected for embedding. Although 
Kalker [12] talks about capacity bounds based on dirty water codes and Hamming codes, it mainly 
focuses on capacity bounds for an allowable control distortion. Li et al. [13] also talk about image 
independent embedding capacity, but the focus lies in finding the minimum possible embedding 
capacity for any image. Thus there is an urgent need to develop appropriate techniques to compute 
the embedding capacity in multi-pass watermarking schemes. For a given image, the single pass 
embedding capacity is simple to estimate and can be directly computed by considering pixel pairs 
eligible for watermarking. Multi-pass embedding capacity estimation is however challenging since 
the subsequent passes depend not just on the cover image but also on the watermarks embedded 
in the previous iterations. Hence in the rest of the paper, we focus on developing computationally 
efficient techniques to estimate the embedding capacity in multi-pass embedding. 
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Figure 1: A block diagram depicting the procedure for this class of watermarking schemes. Here 
P denotes the disjoint partitioning block, F denotes the Flag stream generator, L represents the 
location map bitstream generator, R is the reconstruction block, C represents the compression 
method and T denotes the transform block. 



2 Framework, Notation and Problem Definition 

2.1 Framework and Notation 

We show in this paper that for a select class of watermarking algorithms it is indeed possible to 
provide good estimates of the embedding capacity over multiple passes. This is a class of transforms 
which operate on independent groups or blocks of pixels, generally known as the expansion and 
mapping based algorithms. In this paper, we propose algorithms for pixel-pair based methods, 
since for majority of these techniques [2, 5, 7, 6, 11, 14, 22], the independent group or blocks of 
pixels are pixel pairs. In other words the entire image is partitioned into disjoint pairs of pixels. 
We further show that the proposed algorithms can also be extended easily to schemes [1] which 
operate on larger groups of pixels (for example pixel triplets and quadruplets). 

We now introduce the notation we will use throughout this paper. Let D represent the domain of 
the pixel pairs, i.e {D = [0, L] x [0, L]}, where L = 255 for an 8-bit Image. Further let ^ represent a 
pixel pair {x,y), where x and y refer to the pixel intensities. The procedure involved in pixel-pair 
based watermarking schemes is depicted in fig. 1. As illustrated in fig. 1, the input image is first 
partitioned through a disjoint partitioning block P, into a set of disjoint pixel pairs, represented 
as S = {^1,^2, • • • ,(,n}- These pixel pairs are generally adjacent to each other, either horizontally, 
vertically or diagonally. Let N be the total number of pixel pairs in the image. For ease of notation 
we will sometimes drop the subscript. We further represent ^' = {x',y'), as the transformed pixel 
pair, after watermarking the pixel pair = 

Recall that in order to prevent overfiow and underflow we have the constraints: < x' < L, < 
y' < L. Thus not every pixel pair is embeddable. Further in order to control the distortion, some 
additional restrictions are imposed on the embeddable domain [6, 11, 22]. Correspondingly only 
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those pixel pairs ^, such that: |x — y| < 9h for some threshold 9h are considered for embedding. 
Thus let the corresponding domain of cmbcddable pixel pairs be Dj C D. Let B be the set of 
all possible bitstreams, which are embedded into an image. In order to maintain reversibility 
of the watermarking algorithms, many times some additional data is required to be embedded 
along with the watermark bitstream [8]. This data is called the auxiliary data stream denoted 
hy A £ M, and hence the disjoint pixel pairs S are analyzed through an auxiliary data block A 
(shown as a densely dashed block in fig. 1), to construct the auxiliary data stream A, needed to 
be embedded along with the image. Thus within the embeddable pixel pairs, both the watermark 
and the auxiliary information have to be embedded. There are two types of auxiliary data used in 
watermarking algorithms belonging to this class. They are the flag bits and the location map. Thus 
the auxiliary block A consists of the flag bitstream generator F and the compressed location map 
stream generator L. Flag bits are typically required to be embedded along with the watermark 
to ensure reversibility. In most algorithms these are either in the form of LSB bits of some pixels 
which need to be stored, or flags containing information regarding some pixel pairs. We represent 
the domain of pixel pairs which contribute towards the flag bit stream as Ojr C D. In other words 
for every pixel pair belonging to Djr, we need to store bits either in the form of flags or LSB bits. 
Generally Bjr c B, as not every pixel pair requires a flag bit to be stored. Let be the flag 
contributed by the pixel pair ^. In most algorithms is a binary bit, and atmost one bit needs to 
be stored for every pixel pair S^. Further these bits need to be stored only for pixel pairs ^ G 
and hence for every ^ ^ Djr, = {</>}. Here (j) represents a nuUbit. Let further r/(/^) represent 
the number of bits contributed by ^. Hence r/(/^) = 0, if = {(j)}. We can then represent the 
flag bitstream as = F(H) = {/^i, /^2' • • • ' /?]v}- For those ^ Dj-, /^^ = {(f)} and they do 
not contribute towards J-". The location map on the other hand is a binary map which stores 
information for every pixel pair in the image. For example in Tian's scheme [11], the location map 
consists of information whether a pixel pair is expandable or not. Let denote the location map bit 
corresponding to the pixel pair ^. We denote C = L(H) = {l^^^,!^^, . . . , l^^} as the location map bit 
stream. Unlike the flagbit stream, the location map is required for every pixel pair in the image (i.e 
r/(/j) = 1, G D) and hence it cannot be directly embedded in the form of the auxiliary data since 
its size is the same as the total number of pixel pairs in the image. Thus it has to be compressed 
using a compressing method C. We then represent the compressed location map as C*^ = C{C). 
Sometimes the flag bit stream may also be compressed and we represent the compressed flag bit 
stream as = C(^). Hence the auxiliary data stream A = F Q C'~^ Q J^'^ . Here indicates the 
concatenation of bitstreams. Hence we have: r]{A) = r]{T) + r]{C^) + r]{T^). Note that one or 
more of these may be null streams, depending on the watermarking scheme used. As we shall see 
later, the sizes of the compressed location map and flag bits may depend on the number of ones and 
zeros in those streams. We assume uj{.) denotes the number of ones in a bitstream and hence uj{jC) 
and uj{T) denote the number of ones in the location map and flag bitstream respectively. Let 
be the region of pixel pairs where the location map bit is 1. In other words: = G D = 1}. 
Similarly we can define D}^ = G Bjr \f^ = 1} as the region of pixel pairs where the flag bit is 1 

We represent the watermark bitstream as G B. Correspondingly we can construct the total 
embeddable bitstream I = SqA, with r){I) = r){S) + r){A). Since we are interested in finding the 
embedding capacity, we assume that £ represents the largest possible bitstream, and correspond- 
ingly the embedded bits I, will be embedded in every embeddable pixel pair. Thus r]{I) is equal 
to the total number of embeddable pixel pairs in the image. Let denote the bit embedded in 
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the pixel pair ^. Further we assume r){i^) denotes the number of bits embedded in ^. Given the 
embcddablc bitstream I, we can then construct an ordered sequence of bits, to be embedded into 
the image such that: I = {i^i,«^2' • • • ''^^ivl with, i^. = (j), if ^ Di or ) = 0. 

Each pixel pair ^ = {x,y) is mapped to another pixel pair ^' = {x',y') through a transform: 
f = T{^,i^). Let T denote the transform block. Let ^° = T(^,0), = r(^, 1) and ^'f' = r(^,^). 
Hence ^' = {^^, ^.^jS,"^}- Thus we embed the bitstream I into the pixel pairs, H, to obtain the set of 
transformed pixel pairs: E' = T{E,I) = {r(^i, i^J, r(^2, ^ja)' • • • > ^(^Af> ^^iv)} = Ui> • • • j ^jvl- 
Finally the watermarked image can be obtained by reconstructing the image from the set of pixel 
pairs S', through a image reconstruction block R. This entire procedure is illustrated in fig.l. The 
maximum embedding capacity for these algorithms is 0.5 bpp since we can embed atmost a bit 
in each pixel pair. However the auxiliary data eats up this capacity. Hence 77(f) = r]{I) — r]{A) 
represents the size of the largest possible watermark embeddable in the image and is the embedding 
capacity of the image. 

In multipass embedding, the entire embedding stage depicted in fig. 1, is repeated at every 
stage on the watermarked image from the previous stage. Further note that throughout this 
paper we also assume that at every subsequent pass of embedding the bitstream will be embedded 
in the same set of disjoint pixel pairs, which were selected in the first pass of embedding. We 
represent the number of passes in multi-pass watermarking as P. In multipass embedding, let 
X};,Aki£k,C,k,^k^ etc. refer to the corresponding bitstream of the k^^ stage of watermarking. In 
other words, Xq = {i^o'^^i'"'' '^^jvl'-^i = {^^o'%'''' '^^at-^ represent the final 

concatenated bitstreams after P passes as X^A, £,T^C, etc. For example X = XqQXiQ ■ ■ -QXp^i. 
Also let p be the fraction of the number of ones in the embedded bitstream to the total size of the 
bitstream. 

In this paper, we provide a general framework of algorithms applicable to any watermarking 
scheme fitting in the above mentioned class. We do not confine our analysis to any particular 

watermarking scheme and try to keep our algorithms as general as possible. However we give some 
examples below of few watermarking schemes which fit into this framework and briefly describe the 
embedding regions and the auxiliary data required by them. 

• Tian [11]: This is the first paper on difference expansion. It embeds a watermark bit into the 
difference of pixels and requires a location map. Here the set of embeddable pixel pairs is 
exactly those pixels which are changeable. Further the auxiliary data comprises here of the 
compressed location map and flag bits in the form of LSB bits. The domains DxjBjtjD^ and 
B>jr for Tian's scheme are shown in fig. 2. Here we have = LSB(|a; — y\). 

• Thodi et al [7]: This is an extension of Tian's algorithm. It uses a combination of histogram 
bin shifting and difference expansion. The two main algorithms here are difference expansion 
with histogram shifting using overflow map (DE-HS-OM), and difference expansion with 
histogram expansion using flag bits (DE-HS-FB). The method is similar to Tian's but they 
select locations for embedding by defining non overlapping regions in the histogram of the 
expandable differences. The regions here are similar to those of Tian's and we do not show 
them separately. 

• Coltuc et al [6]: They use a reversible contrast mapping (ROM) method of embedding wa- 
termark using a simple integer transform on pairs of pixels. Again the domains Dj, Dj- and 
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Figure 2: Illustrations of the regions of Dx, Dj-, D^r, and D^, for the scheme of Tian [11]. The regions 
are plotted as 255 x 255 matrices and the white region represents the corresponding domains. Note 
that here is almost the entire region D. We assume 6^ = 255. 
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Figure 3: The regions in above two and the bottom-left figures represent Bx, Bjr and B^r respectively 

for Coltuc's scheme. The bottom left Image represents the region Dj for the scheme of Weng et al. 
The regions are plotted as 255 x 255 matrices, and the white region represents the corresponding 
domains. 



B^r for Coltuc's scheme are shown in fig. 5a, 5b and 3c. Further we have here: = LSB(a;). 

• Weng et al [22]: The concept of invariability of the sum of pixel pairs and pairwise difference 
adjustment (PDA) is exploited to embed a watermark in a pixel pair. This method requires 
location map compression. Again the different domains for the scheme of Weng are shown in 
fig. 3d. 

As it is clear from the above summary, majority of the techniques are location map based and 

require additional data compression. The compression of the location map significantly increases 
the complexity of the algorithms, further emphasizing the need to provide efficient multi-pass 
embedding capacity estimates as described in section 1. Each watermarking algorithm requires 
atleast two iterations over the entire image to first identify the embeddable regions and get the 
auxiliary information and then actually embedding the watermark, in addition to a possibly extra 
iteration to compress the location map. This is also evident from the watermarking procedure shown 
in fig. 1. In particular the location map compression is a computationally expensive task. Thus the 
watermarking schemes have a lot of associated overhead. The RCM based method however does 
not require any location map, and is comparatively the most efficient algorithm. We however show 
that our estimation method performs much better that even Coltuc's method computationally, 
while still providing reasonably precise estimates. 
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2.2 Problem Definition 



We assume that the probabihty that a given bit in the bitstream is 1 is p, and is known. We claim 
here that due to the block based approach of this class of watermarking schemes, the embedding 
capacity for a given image and watermark depends only on p and not on the actual bitstream 
itself. Infact we experimentally verify this in section 5.1. Hence we pose our problem as providing 
good estimates of the total embedding capacity r){S), in multi-pass watermarking schemes given p 
and the number of passes P. Estimating the embedding capacity r/(<?) requires estimation of total 
number of embeddable pixel pairs r){I), along with the auxiliary data size r]{A), required to ensure 
reversibility. 

2.3 Our Contributions 

We exploit specific properties of the pixel pair based watermarking schemes to efficiently esti- 
mate the multi-pass embedding capacity in computational costs significantly lower than actual 
watermarking. We first propose a co-occurrence based method which iteratively updates the co- 
occurrence matrix of the image to effectively estimate both the embedding information and the 
auxiliary data size required at every pass. Subsequently we propose a pre-computable tree based 
implementation which concisely represents the multi-pass structure for every pixel pair. We then 
prove the equivalence between these two methods. We then use these algorithms to estimate the 
embedding capacity of an image for a given value of p. Lastly we also propose methods to estimate 
bounds on the embedding capacity. Though this is mainly a theoretical paper, we perform a num- 
ber of experiments and evaluate our algorithms on the watermarking schemes of Tian and Coltuc 
et al. We show that these estimates are reasonably close to the actual capacities of these images. 

3 Embedding capacity estimation 

In this section we present methods to estimate the embedding capacities, given the bitstream 
distribution [p). The pixel pair tree method though a very fast estimation procedure, requires 
an offline stage and some additional memory. The co-occurrence matrix method is, however, an 
iterative method, but is amenable to considering different probabilities at every iteration. 

3.1 Proposed Algorithms 

Let i3 G B represent a general bitstream, to be embedded into an image. It could represent the 
embedded bitstream (X), watermark flag bit stream {T) or the location map bit stream (£). 
There are typically two problems of interest related to this. One is estimating the size of the 
bitstream 'q{B), like estimating the number of embeddable pixel pairs ■q{Z) or the size of the flag bit 
stream r]{J-). The other is estimating the number of ones in the bitstream a;(i3), which is relevant 
in estimating the compression of the bitstream, as we shall see later. For example, to estimate 
the size of the compressed streams r]{C^) or ■q{T'~^) we would need to estimate the number of ones 
in the corresponding streams, ijj{F) and oj{C), respectively. Also we assume that denotes the 
bitstream corresponding to the k*^ stage of embedding. Let represent the bits contributed by 
the pixel pair ^, and r]{h^) represent the number of bits contributed by the pixel pair ^ towards the 
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bitstream B in a single pass. Since we are concerned mainly with pixel pair based watermarking 
schemes, is a bit and r]{b^ is cither or 1, depending whether ^ contributes towards B or not. 
Again for example,when B isI,J^ and C, b^, is i^, and respectively. Further, represents 
the region of pixel pairs which contribute towards the bitstream and represents the pixel pairs 
where b^ = 1. In other words Be = G B \b^ ^ (/)} and B^ = G Bg \b^ = 1}. 

3.1.1 Co-occurrence Matrix based method 

Definition 1. (Pair-wise co-occurrence Matrix). We define the pairwise co-occurrence matrix C 
of size Lx L, similar to the conventional co-occurrence matrix [10], as the population (distribution) 
of co-occurring pixel pairs in an Image. In our context, it represents a count of the number of times 
a pixel pair occurs in the image. Thus given the disjoint pairs of pixels S = {^1,^2, ■ ■ • ,Cn}, we 
can define it as: 

N 

^iO — ''^^^(^ ~ ^j) where I(.) is the indicator function. (1) 

i=i 

In this paper all subsequent usage of the term co-occurrence matrix woTild actually mean a 
pairwise co-occurrence matrix. We now provide a scheme to iteratively update the co-occurrence 
matrix and estimate the size of the bitstream at a given stage using the corresponding co-occurrence 
matrix at that stage. We start with a co-occurrence matrix Co initially, calculated directly from the 
cover image. The initial image and its corresponding co-occurrence matrix represent the O*'' stage. 
We then iteratively update the co-occurrence matrix at every stage (pass of embedding) using the 
following scheme: Let the co-occurrence matrix at the k^^ stage be Cfc. Then for every pixel pair 
C € P fraction of the total number of these pairs of Cfc(^) will become while 1 — p oi these 
will transform to in C^+i. For those pixel pairs not embeddable, ^ is transformed to This is 
elaborated in detail in Algorithm 1. 

Algorithm 1 Statistical estimation of the embedding capacity from the co-occurrence matrix for 
a given probability of watermark p and number of passes P. 

Find the pair- wise co-occurrence matrix Co from the given image using Equation (1). 

A; = 0. 

repeat 

Set all entries of Ck+i to 0. 

for ^ G Di do 

Ck+iie) ^ Ck+ii^) + {i-p)Ck{0- 

end for 

for ^ ^ Dx do 

Cfc+i(e<^)^Cfc+i(e<^) + Cfc(o. 

end for 

k ^ k-\-l 
until k < P 
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The co-occurrence matrix at the k^^ stage can be used to estimate the embedding capacity for 
the {k + 1)*'*^ pass. For example the embedding capacity of the first pass can be estimated from the 
initial co-occurrence matrix Cq. Thus we can write: 

ri{Bk) = ^CkiOv{h) = E ^kiO- (2) 

It is clear that for a P pass watermarking it is sufficient to estimate the co-occurrence matrix upto 
the P - l^^ stage. Thus the total size of the bitstream r]{B) can be estimated as: 

p-1 p-1 

k=0 k=0 ^eDe 

We can similarly find the number of ones in the bitstream u{Bk), by replacing r]{h^) by 6^. Thus 
we have: 

u^{Bk) = Y,cmh= E ^'^(^)- (4) 

In typical images the co-occurrence matrix is diagonally dominant with only about 10 % of the 
entries non-diagonal. Typically the disjoint set of pixel pairs are chosen as the neighbouring pairs 
of pixels and hence both the pixels are very similar in magnitude and correspondingly the possible 
set of pixel pairs is actually a small fraction of the total size. Hence one may use the standard 
sparse matrix representations [9, 17] to make our algorithms computationally more efficient. 

Note that the co-occurrence based method is also an iterative procedure similar to any wa- 
termarking scheme. It is however computationally much more efficient than any stage- wise wa- 
termarking scheme for the following reasons: 1) The size of the co-occurrence matrix (256 x 256) 
is smaller than that of any typical image and hence the computation is much lesser. 2) A large 
overhead is associated with actual embedding due to the many iterations involved in calculating 
and embedding the auxiliary data, along with the required compression algorithm. 3) The sparse 
matrix representation of the co-occurrence matrix further improves the computation. In fact it is 
many times faster than actual embedding, as we show in the timing analysis in the results. 

3.1.2 Pixel Pair Tree based method 

Definition 2. (Pixel-pair tree). The pixel-pair tree for a pair ^ is defined as a tree which starts 
with the pixel pair ^ and traces a specific path based on the subsequent embedded bits as this pixel 
pair evolves and represents all feasible paths of watermarking for a given number of passes. 

A pixel pair tree for a pixel pair (10, 12) embedded using the watermarking scheme of Coltuc [6], 
is shown in fig. 4. This tree concisely represents the entire life cycle of a particular pixel pair upto P 
passes. We begin by first defining the notation we will use for the pixel pair tree. Let represent 
the set of all paths for a given pixel pair. Note that we are assuming here that this is a P— stage 
watermarking. For example in fig. 4 the possible paths for the pair (10,12) upto P = 4 passes are 
{(10,12), (9,14), (5,18), (4,18), (4,18)}, {(10,12), (9,14), (5,19), (4,19), ((4,19)} and so on. Let s 
denote a particular path in and let s[k] denote the pixel pair at the k^^ stage of embedding in the 
path s. Again, consider the path s = {(10, 12), (9, 14), (5, 18), (4, 18), (4, 18)}, then we have s[0] as 
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Figure 4: An illustration of the pixel-pair tree for the pixel pair (10, 12) for Coltuc's method [6]. A 
similar tree can be constructed for any block based watermarking scheme. 

the pixel pair (10,12), s[l] as (9,14), s[2] as (5,18) and so on. Similarly, ig[o], is[i] and i^p] are 0,0 and 
(j) respectively and f?(«s[o]), ??(^s[i]) and ^/(i^p]) are 1,1 and respectively. Further let sj denote a path 
in the pixel pair tree S^. (the pixel pair tree for the pair ^j). Then we define s = {si, S2, • • • , sat} as 
a particular path configuration chosen for the pixel pairs H = {^1,^2, • • • ,Cn} and let S represent 
the set of all possible path configurations s. Let ps denote the path specific probability of the path 

s. It represents the probability that the pixel pair will evolve through the specific path s. Then 

p-i 

clearly, we can write Ps = Ps[k]y where Ps[k] denotes the probability of transition from s[k] to 

A:=0 

s[k + 1] given whether s[k] is embeddable. Thus 

r p, if = 1 & s[k] e Ox 

Ps[k] = < 1 - P, if is[k] =0 k s[k]£Bx (5) 
[ 1, if s[k] ^ Di 

Let r/(s, i3) represent the size of the bitstream obtained through a specific path configuration s = 

{si, S2-, - ■ ■ , sn}- Further let r/(^, s, B) denote the total number of bits contributed by a specific pixel 

p-i N 
pair ^ through the path s. In other words 7]{S,,s,B) = Yl vi^slk]) and rj{s,B) = Yl ■^j, ;B). 

k=0 j=i 
The estimate of the length of the bitstream B can then be given as ri{B) = Es6§(77(s, i3)), where 
Esgg(r/(s, B)) represents the total expected size of B by considering every possible path configuration 
si £ S^^ , S2 G , • • • , Sat G S^^ , which we can also write as s G §. We then write this expectations 
Esgg(?7(s, S)), as the sum of the expected number of bits (Esgs^(T7(^, s,B))) contributed by every 
pixel pair ^. Since we have considered here a P stage watermarking, for simplicity of notation, we 
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represent this expectation as Ep(^,H). Thus we can write: 

N 

N N 

i=i sjeSj^, ^ fc=o ^ fc=o 

In a similar manner we can also find the size of the bitstrcam rj{Bk) as an expectation over every 
possible path in the pixel pair tree. We use the notations similar to our above derivations, with 
each quantity now representing its stage-wise equivalent. However note that here ri{^,s,Bk)) = 
rf{bs[k])- Further since we are interested in estimating the size of Bk, the pixel pair tree extends 

k 

only till the A;*'* stage and hence = H Pslm]- Again for convenience we use E^(^, Bk) instead of 

m=0 

EseS{(^(C) s, Bk)) and we can write: 

N 

r]{Bk) = Bs&ivis, Bk)) = ^Bsj(.s^.ivi^j,Sj,Bk)) 

N N 

= ^E^{^j,Bk) = Y, E v{^j^^J^^k)Ps, 

= E E 'l(bsAk])[ HPsAm]] (7) 
j=l SjeS^. ^ m=0 ^ 

Thus we summarize the expressions for Ep(^,B) and E''(^,Hfe) as: 

P-l X p-1 



E^(e,H)= e( n^'^ilE^^^^w) (8) 

seSj ^ fe=o ' k=o 
B\C,Bk)=Y^v{bs[k])( llP4m]) (9) 

sGSj ^ m=0 ^ 

We can then find the number of ones in the bitstreams Bk and B. This problem is very similar to 
that of finding the size of the bitstreams, since now instead of summing over r]{bg[k] we need to sum 
over the actual bits contributed bg^k]- Hence we do not again reformulate the problem, but use the 
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same procedure above, by just replacing by fe^jj.] and give the final results as: 

AT AT p-i p-i 

j=i j=i Sj-eSj^. fe=o fe=o 

N N k 

j=l j=l Sj-eSg^. m=0 

As evident from above, the computation of the number of ones of a bitstream is exactly the same 
as that of computing the size of that bitstream except for replacing by bg^i^j. In the rest 

of the analysis we provide some important theorems and properties related to the total size of the 
bitstreams as well as the size of the bitstreams at every stage. These properties however also hold 
for the number of ones in the these bitstreams. 

• It is easy to show that: E^(^,i3) = Ylk=o ^"^(^^^k)- 

• The formulations above arc in terms of the image pixel pair (equations (7) and (6)), and we 
call them the image pixel-pair based formulation. We can alternatively also reformulate them 
in terms of the co-occurrence matrix. Prom the definition of the co-occurrence matrix, it is 
clear that we can write- 

r?(Hfe) = 5]Co(OE''(C,Hfc) (11) 
r/(5) = 5^Co(OE^(C,^) (12) 



• The co-occurrence based algorithm and the tree based algorithm provide exactly the same 
estimates. In other words, the estimates provided by equations (2) and (3) are exactly the 
same as the ones provided in equation (7) and (6). We prove this formally in theorem- 3 
(provided in Appendix- A). 

• The formulation of Ep(^, B) and E''(^,Hfc) requires an exhaustive search over every possible 
path making them exponential. Hence we need to provide a more efficient estimation mecha- 
nism. We observe that there exists a recursive relationship, which we describe in the following 
theorem. For simplicity we represent ri{bg^) as 77(6^) since s[0] = ^. 

Theorem 1. Equation (9) can be reformulated in a recursive manner (for k > 0) as: 
T,m.K>^S (l-p)E''(eO,Hfe_i)+pE^(C\Bfe_i), if^eBx 

with the base case of: E^{^,Bo) = r]{b^). Equation (8) can also be reformulated (for P > 1) as: 

rV^B)-S ri{bi)+PK-iie,B) + il-p)El,{e,B), if^eB^ , 
^P(^'^) - 1 r,{b^) + Bl_,{^t',B), ifU^x 

with the base case: E'^i{^,B) = 'r]{b^). 
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Proof. The base cases can be derived from equations (9) and (8). Wc first prove it for the case 
of the stage-wise expectation. Then we consider for k > 0, the case where ^ is cmbcddablc. We 
define S^o and S^i as the set of possible paths for the pixel pairs and respectively in P — 1 
passes. Also let s'^, represent a possible path in S^o and S^i, respectively. The main idea here is 
that we break up the set of paths into 2 groups, one of which starts with ^ and consists of the 
paths in S^o, while the other also starts with ^ but consists of those in S^i. Then we can rewrite 
equation (9) as: 

k-1 k-1 
E''(^,.Bfe)= Yl PYlPs^m]Vibs^[k-l])+ XI (^-P)'[lPsO[m]VibsO[k-l]) (15) 

Note that the set of pixel pairs Vs G S^, s[k] are captured fully by the pixel pairs Vs^ G S^i ,s^[k — 1] 
and Vs° G S^o,s'^[/s — 1]. Also recognize that within each summation, we have E''(^°, i3fc_i) and 
E^(^^,Hfe_i) embedded and hence we can rewrite equation (15) as: 

E'?(e,^fe) =pE''(e°,^fc_i) + {i-p)E^{e,Bk-i) 

We can similarly prove this for the case when ^ is not embeddable. 

We now consider the case for the total expectation of a given pixel pair. Again we use the same 
idea as above and rewrite equation (8) as: 

P-2 P-2 P-2 P-2 

Ep(C,^)= X p]lPs^k]{J2'i(bsi[k]) + vik))+ J2 i^-p)IlPs"[k]{Y.'^(^soik]) + v{k)) 

s^eS^i k=0 k=0 s°eS^o fe=o fe=o 

We observe that r]{b^) occurs in every term. Again we recognize that within each of the summations 
are E^.i(^°,B) and El_^{^^,B). Thus we can write: 

Bl{^,B) = {l-p)Bl,{e,B)+pBl_,ie,B)+vih)(p E IlPs^k] + {i-p) E IlPsO[ki)ii(i) 

Since the paths in S^o and S^i form a complete subtree, X^^igs^i 11^=0^ ^'^sM*^] ~ ^ 

SsOgs nr=o^^'s°[fc] ~ I- Thus we observe that equation (16) transforms to equation (14) for the 

case when ^ is embeddable. We can similarly handle the case when ^ is not embeddable. □ 

Note that we can similarly reformulate E'^(^,i3;i.) and Ep(^,;B) recursively and the expression 
is similar to (13) and (14) except for replacing 77(6^) by 6^. 

It is clear from equation (14) that Ep(^, i3) or the total expected number of bits for a given pixel 
pair is a polynomial in p. Hence in order to make the offline stage independent of p, we compute 
the polynomial coefficients of Ep(^,;S) for every pixel pair and store them. Thus we can directly 
use these stored coefficients to compute Ep(^, H) for any given p without having to recompute them 
every time. In addition the set of coefficients is typically iteratively computed for many values of 
P. Hence we can use a simple 'mcmoization' [15] while computing the total expectation for a given 
pixel pair E^ and using the previously computed values of Ep -^, in equation (14). Thus the tree 
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formulation of equation (8) can be computed in linear time using these simple tricks. The same 
idea can be extended for computing the stage-wise expectation. 

Thus there are two stages of computation, i.e the online stage and offline. The offline stage is 
image independent and we compute the coefficients of the polynomials, described in equation (14) 
and equation (13). These can be iteratively computed for various values of P in linear time. The 
online stage consists of just a single iteration to use the values of Ep(^,;B) or 'E^{(,Bk) depending 
on what has to be estimated. 

Thus this method provides an extremely efficient implementation to estimate both the size and 
the number of ones in a bitstream. Importantly it can compute this in a single iteration, although 
the embedding is multi-pass. Correspondingly, we can find the multi-pass embedding capacity, in 
a time complexity comparable to that of single pass estimation. Thus it is more efficient than the 
co-occurrence based estimation framework. The only drawback is that it requires an offiine stage 
and additional memory to store the total or the stage- wise expectation for every possible pixel pair. 



3.2 Compressed bit-streams 

Compressed bitstreams sometimes occur as auxiliary data, either as a compressed location map 
or compressed flag bits. Correspondingly it is necessary to estimate the size of these compressed 
streams. Let be the bitstream obtained by compressing B. Further let C/(.) represent the 
compression factor (a number between and 1) for a bitstream and a given compression scheme. 
We can then find the size of the compressed bitstream as: 

p-i 

viB^) = ^vmCf{Bk) (17) 

fe=0 

However, we cannot find the exact bitstreams at every pass Bk, just using the bit probability, 
since they depend on the exact watermark sequence. Further we would have to actually embed 
the watermark in order to find the transformed pixel pairs and hence our statistical framework 
cannot be extended to this. However the Kolmogorov complexity of a binary bitstream of length 
n, with r number of ones has been shown [3] to be bounded by nHo{^), where -ffo(-) refers to the 
entropy [20] of a bitstream. Correspondingly the compression of these bitstreams, can be estimated 
by just finding the number of ones oj{B}^) in these bitstreams at every stage. These estimates 
though approximate are still quite good and useful in practice. In the last section we proposed 
algorithms to estimate oj{Bk). Hence we can estimate the size of the compressed bitstream at every 
stage r]{B^) as well as the total compressed bitstream size r]{B'^) as: 

r^{B^,) = r^{B,)Ho(^) (18) 

,(B^) = x;\(^.)^o("§|). (19) 

fe=0 '^^ 

3.3 Embedding Capacity Estimation 

Using the tools we have developed in the earlier sections, we are now in a position to estimate the 
embedding capacity. In particular, we use the co-occurrence or the tree based algorithm to find the 
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total number of embeddable pixel pairs and the auxiliary data size. Hence we can find the 

total and the stage-wise embedding capacity as: 



r,{Sk) = viW - vm - r?(J-F) - r?(£^) (20) 

V{£) = r/(X) - r^{T) - r?(J^C) - (21) 

Note that generally all these types of auxiliary data will not occur together in any watermarking 
scheme and hence one or more of these will be null streams. We give below estimates of each of 
ri{Ik)iil{^k):V{^k) ^^'^ vi^k) terms of the co-occurrence and tree based algorithms. 

N 

V{lk) = E ^"^0 = E ^'i^j^W (22) 

^6Bx j=l 

N 

r){J^k) = E ^kiO = ^B^{^j,J^k) (23) 

i&Djr j = l 

Further we can find the estimates of the number of ones in T and C as: 

AT 

u;{J^k) = E ^^(0 = EE"(e,-,^fe) (24) 

N 

u;{£k) = E ^kiO = J2^''i^j,^k) (25) 
Finally we can find r]{F^) and r]{jC,^) as: 

Vi^^) = VmHoi"^), viC^) = NHoi"^) (26) 

We have r){jC,k) = -^i since the size of the location map is exactly that of the number of pixel pairs. 
We can then find rj{I),rj{T),ri{T'^) and rj{C^) by summing each of the above obtained expressions 
from = 1 to P — 1. The tree based estimates for ri{X) and ri{F) can directly be obtained however, 
by just replacing the stage- wise expectation E'' by the total expectation Ep. 



3.4 Change in probability of ones due to auxiliciry data stream 

One subtle point worth mentioning here is that the embedded bitstream contains both the auxiliary 
data as well as the watermark. We have until now represented p as the probability of the bit '1' 
in the embedded bit stream. Let py^r represent the probability of the bit '1' in the watermark. 
Then p ~ py/ since the watermark is the major component of the embedded bitstream atleast 
for the initial passes. However, since a few watermarking schemes (for example Coltuc's method) 
significantly depend on the probability of the embedded bits, it may be necessary to find at every 
stage the probability of the bit '1' in the embedded bitstream. For this we need to estimate the 
probability of '1' in the auxiliary bitstream, which we denote by pA- In order to estimate pA, we 
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need to estimate the number of ones in the auxihary data stream. In the case where the auxiliary 
data is represented as compressed bit streams, wc can assume that the compressed bitstream will 
be random, and the number of ones and zeros are the same. Hence we can assume the probability 
of the bits contributed by these compressed bitstreams is 0.5. The number of ones in the flag bit 
stream can be computed, as co{J^k), and hence the probability pA can be found as: 

„ _ o^(J-fc) + 0.5(r?(/:Gfc) + r?(7-Gfc)) 

We can then easily modify algorithm- 1, and use p = pk at every stage of the algorithm, where p^ 
can be defined as: 

Here A contains combinations from the flag bit stream, compressed location map and compressed 
flag stream. We call this method the Co-occurrence based adaptive probability (CAP) algorithm. 
Using the co-occurrence matrix at the k*^ stage Ck, we first estimate r]{Ik) and r]{Ak) at every 
iteration using techniques discussed in section 3.3. These estimates are then used in equations (27), 
(28) to update the probabilities pk- Finally using the updated probabilities pk and methods given 
in algorithm-I, we estimate Ck+i- Thus we can modify the iterative co-occurrence based method 
to consider weighted probabilities at every iteration. The tree based implementation, however, 
inherently considers a single probability and is not amenable to a change in probability at every 
iteration and hence cannot be used to provide accurate estimates. However this change is necessary 
only for those schemes which significantly depend on the probability of the watermark. Further 
even for such schemes, the estimates obtained by considering only the watermark probabilities are 
observed to be practically very useful in most cases, without the need to consider the updated 
probabilities pk- We show in results that the error magnitude is quite small and we achieve descent 
approximations by only considering p pw- Thus unless extremely accurate estimates are required, 
we may continue to use the tree based algorithm in section 3.1.2 to achieve quick estimates. 



3.5 Estimating the optimal number of peisses 

The methods we have discussed upto this point have concentrated on providing estimates of the 
embedding capacity for a given number of passes P. We can however find the overall embedding 
capacity and the optimal number of passes as well. For typical watermarking schemes the stage- 
wise embedding capacity keeps on decreasing. This is due to an increase in the number of pixel 
pairs not eligible for embedding at every successive pass of watermarking. Further at every stage 
the overhead in the form of the size of the auxiliary data also generally increases. Thus the optimal 
number of passes is reached when the stage-wise embedding capacity £k touches zero. 



3.6 Generalization to watermcirking schemes operating on groups of pixels 

We have considered upto now watermarking schemes operating on pairs of pixels only, and hence 

we have assumed ^ = (x, y) in our algorithms. However our methods can be extended to consider 
n— tuple of pixels ^ = (x^, x^, • • • , x"^). Consider the case of n = 3 or triplets of pixels [1]. Here in 
every triplet 2 bits can be embedded, so we need to consider 4 cases of embedding 00,01, 10 and 
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11. Let the transformed pixel triplets be C , C and ^ . If a pixel triplet is not embeddable 

it will transform to . Then we can easily extend the co-occurrence based method, by considering 
here that (1 — J')^,p(l — p),p(l — p) and fraction of the pixel triplets of Cfc(.^) transform to 
Ck+i{i^^),Ck+i{i^^)jCk+i{£,^^) and Ck+i{£}^) respectively. The tree based method can similarly 
be estimated to consider a tree of groups of pixels instead of pixel pairs. The tree will have more 
branches. For example in the case of pixel triplets, each embeddable pixel triplet will have 4 
possible paths to C"'^) ■^'^^j C^'^ respectively. We can however define a recursive formulation 

similar to equation (14) only replacing the equation for the embeddable case as: Ep(^, B) = ri{h^) + 
p2E^.,(Cii,B) +p(l -p)(E^.l(e°^B) + E^.,(^io,B)) + {l-pf^l_^{e^B). Our methods can 
similarly be extended to deal with any arbitrary n, where-in every embeddable pixel n— tuple can 
get converted to 2"~^ possible pixel ra— tuples since within every such tuple n — 1 bits can be 
embedded. 



4 Bounds on the Embedding Capacity 

In this section we provide the bounds on the embedding capacity. The maximum embedding ca- 
pacity of a given image can be thought of as the largest possible watermark which can be embedded 
into that image. In other words no watermark with a size is larger than this, can be embedded 
completely into that image. We use the same tree based implementation discussed in section 3.1.2 
to find the maximum possible embedding capacity, by considering every possible type of watermark. 
We denote the largest size as rf^^^{.). Then using the notation developed in section 3.1.2, we can 
write: 



7?(X, s) - rj{T, s) - r]{C^, s) - ri{J^ , s) 



(29) 



Recall that s G S here represents s\ G S^^ , S2 G > ' ' ' > ■Siv £ • Note that though not shown, 
rf^^^{£) is a function of P, and it represents the maximum possible embedding capacity obtained 
in P passes. In order to find the maximum possible embedding capacity in the image, we need to 
find m^x rf^^^^{£). As discussed in section 3.5, the total embedding capacity initially increases till 

the optimal number of passes after which it begins to decrease. Hence we can simply start from 
P = 0, till the value of P where rf^^{£) begins to decrease. The corresponding value of rf^^{£) 
will be the maximum possible embedding capacity for that image. 

Theorem 2. The maximum possible embedding capacity rf^^{£) upto P passes can be upper 
bounded by: 

N 

N I'-l . Z]^'^(0''^fc)s P-l . N . 

rr^{£) < ^(M^^,-,!) - M^(e,-,^)) - ^ ^^o('^^^^^ - ^ L''(e,-,^fe) ) min(afe, ft) (30) 

with: 

N N 

<^k = Hj- ), /3k = Hol- ) (31) 



18 



and for a general bitstream B: 



p-i 



Ml{^,B) = max V r?(6,[fe]), M-(e,Hfe) = max b. 



fc=0 



:[k] 



L^S^Bk) = mill ri{bs[k]), l-'^i^, Bk) = min b^ik]- 

Further the bitstream Tk is formed by inverting every bit in Tk ■ 

Proof. We start with equation (29) and slowly relax the constraints as follows: 

s) - riiT, s) - riiC'^,s) - ri{J^, s) 



(32) 

(33) 



< max 

sgS 



ri{I, s) - ri{J^, s) 



min?7(>C , s) — min?7(^ , s). 



We now consider the first two terms in the above expression. We can expand it as follows: 

N 



max 
ses 

N 

Emax 

j=l ^ h 

N 

j;(M^(C,-,X)-M?,(e,-,^)) 



max > 



77 (X, s) - r]{F, s 



N 



■P-1 



P-l 



, = 1 «J \-k=0 k=0 



(34) 



Now we consider the term involving the compressed location map. Since this term appears with 
a negative sign, finding an upper bound on 77™^^ (£^) is equivalent to finding a lower bound of this 
term. Hence we first replace the compression term by the entropy of the location map bitstream at 
every stage, which is a lower bound to the size of the compressed bitstream. Hence we can write: 

P-l . P-l v^iV , 

minr?(/:C,s) > min V NHo{^^%^) > min V NHo r'=™ ) 



k=0 



k=0 



p-l \-^N J 

> VminiVgo( ^^=:>''' )- 



fe=0 



AT 



We then further simplify the above expression, using the lemma 1 (Provided in Appendix-B). 
Observe that Ck is typically dominated by ones initially since most of the pixel pairs are cmbcddable. 
Further compression is only possible till w(i2fc,s) > N/2 since, as the fraction of the number of 
ones reaches 0.5, the compression factor reaches 1 and hence embedding is no longer possible. Thus 
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"^^^'^"^ > 0.5, VA; and hence using lemma 1 directly with /(s) = "^^^^'^^ we have: 

P— 1 s-^N 1 P— 1 v~^^ ; 

, " (35, 

fe=0 

We now finally consider the term involving the compressed flag bitstream. Again similar to the 
compressed location map size estimation, we can write: 

P—I 

minr?(^C^,) > min ^ ,7(^fe, s)iIo('^^^) 

P—1 

We now consider each of the terms above separately. Consider the first term in the above expression. 

AT 

min77(J"fe,s) = min^riif^.^k]) 

N N 



^ min r/(4.[,]) = J] L''(C,-, J^,) (36) 



j=l ' j=l 

Now we consider the second term. Again we take the min term inside the entropy. However unlike 
the case of the compressed location map, we cannot comment on the value of whether it 

lies within [0.5, 1] or not. Hence we cannot use lemma- 1 directly. However we use lemma 2 (Again 
provided in Appendix B), and define h{s) = ^ j to obtain: 

mmHo[—— — -) > mm Ho{max—— -),Ho[mm—— [6^) 

Consider the first term within the entropy. We use a trick here of defining fg^f.j = 1 — fg^^, if 
s[k] G Hj- and otherwise. Then observe that r/(/s^[jt]) = fsj[k] + fsj[k]- Thus we can write: 
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/ a;(J"fc,s)\ ^ ( T,f=ifs,[k] \ rj f 1 
Hq max — ^— — f = i^Q max — ^ ]= HqI max — — = 

= v-JV F )>-f^o( ^ ^iv F 



j=l Is J [k] maxsgs 2^j=i /sj [k] 

- ^°(" Ef=imin,,es,. )- ^°( ~Ef=iL"fe,^fe) ) ^^^^ 

Similarly consider the second term and using derivations similar to above, we can obtain: 

Ho f min > ^0 ( ^) (39) 

Ef=iL'^fe,^fc) 

Thus the theorem is proved by combining all the above equations. □ 

Estimation of the maximum embedding capacity, requires estimation of the quantities in equa- 
tion (32). They however require exhaustive search over the pixel pair tree for every pixel pair, due 
to which the computation becomes extremely expensive. We can however recursively reformulate 
these expressions, similar to equations (13) and (14). Thus we can recursively compute M^(^, B) 
for P > 1 as: 

The base case here is M^(^,H) = r){b^). Similarly we reformulate the expression for M.^{^,Bk) for 
k > as 

max(M-(eO,Bfe_i),M-(^i,Sfc_i)), if ^ G 



M^(c,^.)^iM^(kB:_;); ^ ^h^^, ^''^ 

Again the base case here is M.^{^,Bo) = b^. 

Note that the expression for Li"(^,Bk) is similar to equation (41) only replacing the max(.) 
by min(.). The above expressions can easily be proved using methods similar to those used in 
theorem-1. Also we can use tricks like memoization etc. similar to those discussed in section-3.1.2, 
to efficiently estimate the maximum embedding capacity. 



5 Results and Discussion 

The presented concept has been experimented on several commonly known images as possible cover 
images. Images were chosen with varying gray level distributions. We however present results here 
only for Lena and Boat images for brevity (shown in fig. 5). A general analysis for several other 
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(a) (b) 

Figure 5: Above are the Boat (left) and Lena (right) images on which we have performed our 
analysis. 

Table 1: The standard deviation of the total embedding capacity at the end of various watermarking 
stages (in 10^^ bpp). 



Algo 


Stage- 1 


Stage- 2 


Stage-3 


Coltuc 


0.3 


1.1 


1.6 


Tian 


0.3 


1.3 


1.5 



images has been provided as additional material . We compute the embedding capacity for these 
cover images for watermarks of varying number of ones and zeros and compare them to the actual 
capacity as obtained through direct embedding of these watermarks in the corresponding cover 
image. We also find the bounds on the embedding capacity for these images. Lastly the codes 
used in the quantitative evaluation of our results shall be made available once our paper has been 
accepted. 

5.1 Dependance of Embedding capacity on probability of watermark alone 

In this section, we statistically verify our claim, that the multipass embedding capacity of pixel- 
pair based watermarking schemes depends only on the probability mass function of the embedding 
bitstream and not on the actual bitstream itself. In particular, we verify this for the Lena image, for 
the schemes of Coltuc [6] and Tian [11], and observe that the variance of the embedding capacity 
obtained by taking 100 different watermarks with p = 0.6 is negligible. The values of the standard 
deviations of the bpp's at various stages of watermarking are shown in Table I. 

5.2 Estimation and bounds of embedding capacity 

We demonstrate in this section, how the embedding capacity estimates obtained using our algo- 
rithms, are reasonably close to those obtained by actual embedding a watermark, for the schemes 
of Tian [11] and Coltuc [6]. In particular we compare the capacities obtained by actually wa- 
termarking (AW) with the estimates obtained by the tree based algorithm (TA), along with the 
Co-occurrence matrix based adaptive probability algorithm (CAP) discussed in section 3.4. Finally 

^Additional results are provided in ' 'http://www.ee.iitb.ac.iii/~sc/main/misc/results.html' ' 
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0.1 O.Z 0.3 0.4 0.5 0.6 0.7 0.0 0.! 

Proahabilitj of watermark bit being 1 (p J 



Figure 6: The capacity estimation for Lena image obtained through Tian's and Coltuc's schemes. 
For each of these watermarking schemes, the results of AW, CAP, TA and Maxcap are shown. 
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Figure 7: The results for Boat image obtained through Tian's and Coltuc's schemes. 

we also compute the maximum possible embedding capacity (MaxCap), discussed in section IV. 
The CAP algorithm is precise since it considers weighted probabilities discussed in section 3.4, while 
TA, though not as accurate is extremely fast . We have computed here the optimal embedding 
capacity, as discussed in section 3.5. Lastly observe that we compute these for various watermarks, 
characterized by the fraction of the number of ones in the watermark {pw)-, and are plotted in fig. 6 
and fig. 7. 

5.2.1 Coltuc et aL 

Coltuc's algorithm does not require any location map, and requires flag bits to be embedded along 
with the watermark. Hence the estimates provided are very precise and accurate. In particular, the 
estimates provided by the CAP algorithm are extremely accurate and consistent with the actual 
capacity for all probabilities. Further the estimates provided by the tree based algorithm (TA) 
assuming p = pw, are reasonably close as evident in fig. 6 and fig. 7. 
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Table 2: Timing analysis for different algorithms (time taken in seconds) for execution in MATLAB 
for a 3.2 GHz PC with 2 GB RAM. 



Algo 


Image 


AW 


CAP 


TA 




Size 










256 X 256 


11.21 


0.17 


0.06 


Coltuc 


512 X 512 


31.32 


0.35 


0.19 




1024 X 1024 


96.65 


0.43 


0.25 




256 X 256 


37.23 


0.21 


0.14 


Tian 


512 X 512 


99.32 


0.55 


0.31 




1024 X 1024 


172.12 


0.73 


0.38 



5.2.2 Tian 

Tian's algorithm requires a compression stage, and correspondingly we estimate the compression, 
using entropy based method discussed in section 3.2. In order to achieve higher embedding, typically 
the flag bits are also compressed. Further we assume that the bitstreams are compressed using 
arithmetic coding. As evident from fig. 6 and fig. 7, the estimates provided for Tian's schemes are 
not as precise as the ones provided by coltuc, mainly due to our approximation of the compression 
of the flag and location map bit streams. Nevertheless, except for the extreme probabilities, the 
estimates for most watermark bit probabilities are quite precise and useful. Notwithstanding this, 
it is also evident that CAP estimates are better than those of the TA algorithm. 

5.3 Comparative Analysis 

The proposed algorithms not only provide useful estimates of embedding capacity but are also 
extremely fast. The CAP algorithm is a more general algorithm since it is easily amenable to 
updating probabilities at every stage, and takes account of the contribution of the auxiliary bits. 
The tree based algorithm is however not amenable to updating probabilities at every iteration, but 
is extremely fast. Further note that the timings given in Table-2 for actual watermarking (AW), are 
the ones obtained by the most efficient look-up-table based implementation of Coltuc and Tian's 
algorithm. It is evident that our algorithms are computationally much more efficient than these 
low cost implementations. In particular, the tree based algorithm (TA) is extremely fast, nearly 
200 to 400 times faster than actual watermarking, while the CAP algorithm, though iterative 
is also reasonably fast, and is about 100 times faster than the actual watermarking. The main 
reason for this is the computational efficiency gained through the sparse representation of the co- 
occurrence matrix and the simple computations of our algorithms, vis-a-vis the complexity of actual 
watermarking due to the numerous iterations of collecting, embeding and possibly compressing the 
auxiliary data along with the watermark. Also as evident from the timings, Tian's algorithm is 
computationally expensive since it involves a data compression stage. 
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6 Conclusion 



This is mainly a theoretical paper, centered around multipass embedding capacity estimation for 
mapping and expansion based algorithms. We provide very general purpose algorithms, which 
can be applied to any watermarking scheme, operating on blocks of pixels. In particular, we 
implemented a co-occurrence based and a tree based algorithm, to efficiently estimate the capacity, 
at computational costs substantially lesser than actually embedding the watermark. We further 
show in Appendix-A, that both of these algorithms provide the same estimates. Prom our results, it 
is evident that the tree based method is computationally more efficient compared to co-occurrence 
based one since it requires only a single iteration. The co-occurrence based method on the other 
hand is however a more general framework and can be extended to consider different watermark 
probabilities at every iteration and correspondingly can be used to provide accurate estimates by 
considering the probabilities of the auxiliary bits as well. 
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A Appendix 

Theorem 3. Under multi-pass embedding, the embedding capacity estimated by the co-occurrence 
based algorithm (algorithm-1) is exactly the same as the one estimated by the pixel-pair tree based 
algorithm. In particular we prove that: 
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k=o ^eD seSf ^ k=o ^ fc=o 

Proof. For every pixel pair ^, let represent the set of nodes corresponding to the pixel pair 
tree of ^. Let n G refer to a node, i.e a pixel pair in the pixel-pair tree of ^ . Further let Sn 

represent any path of the tree, containing the node n and let dn represent the depth of node n. 
Hence Sn[dn] = n. Now we prove the first part of the theorem. Wc start with the LHS and from 
algorithm 1, it is evident that for any node n G with depth dn = k, the number of pixel pairs 
contributed by the pair ^, to the pixel pair n is C(^) nm=o^'sn[m]' '^^i^ clear from the updation 
step in algorithm 1. We then break down the sum X^^gjj C'fc(^)6^, into contributions from every 
node n G with depth dn = k, from every pixel pair in ^ G B. In other words we can write: 

k 

YCkm = Y E coio u PsnimMk] (42) 

^SD ^eDneN^:dn=k m=0 

Now it is obvious from the definition that: J2neN^:dn=kUrn=oPsn[m]bs4k] = E.eSj U.m=oPs[m]bs[k]- 
Hence using this in equation (42) we can prove the first part of the theorem. 

We now consider the second part. Using the result from earlier section we can write: 



p-1 p-1 k 



YY^km = EE E coio U psn[m]bs 

k=0 ien ^eO k=0 neTSi^:dn=k m=0 

dn 

= E E '^o(^) n^^"W^^"Ki 

^eD neNj k=0 

In the above expression, observe that we are summing over every node in the pixel pair tree as k 
goes from to P — 1. Hence we re- write the expression, by explicitly summing over every node in 

the pixel pair tree. We finally replace the variable m by k, to obtain the above expression. In order 
to prove the second part of the theorem, it is sufficient to show that for every pixel pair ^: 

d„ P-1 P-1 

E ^oio -Ylpsnikibn = Yl ^oio n p^ii^] E 

neNf k=0 seSj k=0 k=0 

Note that we used the fact that = bn- We then start with the R.H.S as: 

p-1 p-1 

R.H.S = 5]Co(On^'#]E^^W 
seSf fe=o k=o 

p-1 p-1 
= E E ^0*^^) n E bs[k]I{4k] = n) 

nGNj seSj k=0 k=0 

P-1 

= E Y^o(^^IlPs[k]bnI{s[k]=n) 

nSNj seS{ k=0 
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In the above equation, we consider only those paths which pass through the node n. Let S„ be a 
subset of paths which pass through node n. Hence we can write: 



dn P 



R.H.S = ^ ^Co{C)Y{ps[k] n Pslk]bv 

neNj seS„ k=0 k=d„+l 

dn P 

neNj fe=0 seSn k=d„+l 

dn 

= Yl (^0iOYlPsn[k]bn 
neNf fe=0 

= L.H.S 

Observe that the nodes in the paths s G S„ with depth greater than d„, form a complete subtree 
and hence we have: X^^ggn nr=i„+iPs[fc] ~ Further also observe that the nodes with a depth 
< dn occur in every path in s G S„. Hence we replace these nodes by s„[/c], since s„ is a path in 
S^. Hence proved. Note that in this proof though we have proved the equivalence of the tree based 
and co-occurrence based algorithms for the estimates of oj{B) and uj{Bk), the same proof can be 
used to prove the equivalence of the estimates of r]{B) and r]{Bk)- □ 

Lemma 1. For any function / : S — >■ [0.5,1], min Ho(f(s)) = i?o(max/(s)). Similarly for any 

seS seS 

function g :S ^ [0,0.5], min Ho{g{s)) = iJo(min ^(s)). 

Proof. First note that Hq{z) is a decreasing function for 0.5 < z < 1. Hence Hq{zi) < Hq[z2) 
implies zi > zi- Using this fact, we now prove this lemma by contradiction. Assume that si = 

arg min Ho{f{s)) and S2 = arg max /(s). Further assume that /(si) / /(s2)- Then clearly 

seS seS 
by definition /(si) < /(S2) and i7o(/(si)) < i^o(/(s2))- This is a contradiction since Hq{z) is a 
decreasing function. Thus /(si) = /(S2) min Ho{f{s)) = i7o(niax/(s)). Similarly we can prove 

the second part. □ 
Lemma 2. For any function ^ : S ^ [0, 1], min Ho{h{s)) > min(iJo(max /i(s)), iJo(min h{s))). 

Proof. This lemma directly follows from Lemma 1. If h{s) < 0.5. then using lemma 1, wc have: 
min HQ{h{s)) = i7o(min h{s)). Similarly if h{s) > 0.5, min HQ{h{s)) = Hoiuiax. h{s)). Hence 

min i?o(^(s)) is definitely greater than the minimum of these two. Hence proved. □ 
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